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Abstract 



The Minority Game is a simple yet highly non-trivial agent-based model 
for a complex adaptive system. Despite its importance, a quantitative ex- 
planation of the game's fluctuations which applies over the entire parameter 
range of interest has so far been lacking. We provide such a quantitative 
description based on the interplay between crowds of like-minded agents and 
their anti-correlated partners (anticrowds) . 
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Agent-based models of complex adaptive systems are attracting significant interest across 
a variety of disciplines |L|. An important application currently receiving much attention 
within the physics community, is the study of fluctuations in financial markets ||. Each 
agent knows the past ups and downs in the price index of a financial market and must decide 
how to trade based on this global information. These decisions then feed back to generate 
the fluctuations or 'volatility' of the price index itself. The resulting dynamical, many-body 
system is hence highly non-linear; indeed the combination of adaptability and frustration 
make such agent-based systems arguably richer and more challenging than those typically 
studied in condensed matter and statistical physics. 

The Minority Game (MG) introduced by Challet and Zhang offers possibly the 

simplest paradigm for such a complex, adaptive system. The development of a complete, 
quantitative theory of the MG is of fundamental importance - the MG has therefore been the 
subject of intense research activity in the physics community p|-|10|j. The most striking fea- 



ture arising from numerical simulations [|J is the non-monotonic variation in the 'volatility' 
a (i.e. the standard deviation of the fluctuations) of the 'price' as the agents' memory-length 
m is varied. Challet et al provided a sophisticated formal connection between the MG and 
spin glass systems yielding many fascinating quantitative results for the MG's dynamics. 
However, no general theory has yet been provided which yields quantitative agreement with 
the numerical results for a Q over the entire range of m, s and N values. 

This paper shows that a theoretical model can be constructed in a surprisingly simple way 
by incorporating the 'crowd' effects (i.e. strong inter-agent correlations) which arise within 
the interacting, many-agent population. The results yield very good agreement with the 
numerical results for a over the entire range of m, s and N. The non-monotonic behaviour 
of the volatility a [|3],[4| results from a fascinating interplay between the actions of groups of 
like-minded agents ('crowds') and the actions of opposing groups ('anticrowds'). 

The MG || comprises an odd number of agents N who choose repeatedly between option 
(e.g. buy) and option 1 (e.g. sell). The winners are those in the minority group, e.g. sellers 
win if there is an excess of buyers. The outcome at each timestep represents the winning 



decision, or 1. A common bit-string of the m most recent outcomes is made available 
to the agents at each timestep. The agents randomly pick s strategies at the beginning of 
the game, with repetitions allowed. After each turn, the agent assigns one (virtual) point 
to each of his strategies which would have predicted the correct outcome. At each turn of 
the game, the agent uses the most successful strategy, i.e. the one with the most virtual 
points, among his s strategies. The strategy-space V m forms a 2 m -dimensional hypercube 
with strategies at the 2 2 " 1 vertices. Fortunately, the MG's standard deviation is essentially 
unchanged if a 'reduced' strategy space U m is used instead of V m !3 : the U m only contains 
2 m+1 strategies or equivalently 2 m strategy pairs {Q}- The two strategies within a given 
pair Q are anticorrelated, i.e. they differ by the maximum possible Hamming distance 
d H = 2 m [Q. Strategies between any two pairs Q and Q' are uncorrelated, i.e. they differ by 
d H = 2 m ~ 1 . The results presented in this paper employ the reduced strategy space. 

If tir agents use the same strategy R, then they will act as a 'crowd', i.e. they will make 
the same decision. If agents simultaneously use the anticorrelated strategy R, they will 
make the opposite decision and will hence act as an 'anticrowd' (Q = (R,R)). If ~ 
for all Q, then the actions of the crowds and anticrowds cancel and the standard deviation o 
of the number of agents Ait) choosing a given option will be small. In contrast if 3> 
for all Q, then a will be large. Since there is no correlation between Q and Q', each group 
Q comprising a crowd-anticrowd pair (riR, n^) can be taken as contributing to A(t) via a 
separate, essentially random walk in time of step-size \riR — n^\. The variances of these 
walks can be summed to obtain the standard deviation of A(t). Hence the MG standard 
deviation is given by 
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where both time-averaging, for a given configuration of initial strategies, and configuration- 
averaging have been carried out. We now demonstrate that this crowd-anticrowd cancella- 
tion underlies the numerical results for a vs. m |. We run the numerical simulation of 
the MG and wait until transients in A(t) have disappeared. At timestep to, we read out the 
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number of players playing each strategy R, where i? = 1,2,... 2 m+1 . For each strategy pair 
Q = (R,R), we obtain — at time to an d hence calculate a. We then average this a 
over 1000 timesteps to simulate the time-averaging (our results are insensitive to the precise 
time- averaging procedure). Finally, we average over 16 runs to simulate the configuration- 
averaging. Figure 1 compares the resulting time and configuration-averaged a with that 
obtained from the numerical MG simulation. The agreement is very good for all m, s and iV 
(not shown). Hence the crowd- ant icrowd cancellation quantitatively explains the numerical 
results for a ||. 

Since the labels R are arbitrary in Eq. (1), the ordering of strategies {tir} has no 
particular significance. At any particular time to, these 2 m+1 strategies can be ranked 
according to their virtual points by a sort-operation acting on the list {ur}. Hence {tir} h+ 
{n p } where p is the virtual-point rank label with p = 1 being the highest scoring strategy 
and {n p } = n p= \, n p= 2, . . . n p=2 m+i. Alternatively, strategies can be ranked according to 
their popularity {n r } where {n r } = e. strategy r = 1 is that used 

by the largest number of agents, strategy r = 2 is that used by the second largest number of 
agents, etc. Note that {n r } can be obtained from {n R } and hence {n p } by sort operations, 
i.e. {nji} i— > {n r } and hence {n p } i— > {n r }. Since each agent plays the available strategy 
having highest virtual points, it follows that 



produce discrete steps in the curves of function of r, particularly for large m where 

the total number of strategies 2 m+1 exceeds N. Hence we convert the theoretical n r values 
of Eq. (2) to an integer. For large m such that 2 m+1 ^> N, the resulting theoretical values 
are typically n r ~ 1 for small r and n r = for r > N. We have checked that the theoretical 
and numerical values for n r agree for general m. 

For the virtual-point ordered list {n p }, the strategy p' = 2 m+1 + 1 — p is always anticor- 
related to the strategy p, i.e. p' = p. Hence knowledge of the sort operation T completely 
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n r = N as required. Agents are discrete objects; hence the simulation tends to 
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determines where each strategy's anticorrelated partner is located in the popularity-ordered 
list {n r }. Since we are here only interested in time-averaged and run-averaged a, we only 
need to consider the probability distribution of locations of f in the popularity-ordered list 
{n r }. We therefore replace the sort operation T by a probability function P(r' = f) which 
gives the probability that any strategy r' is the anti-correlated partner of strategy r in the 
list {n r }. Hence Eq. (1) becomes 
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When the virtual-point ordered list {n p } and the popularity-ordered list {n r } are identical, 
then P(r' = f) will be a 5-function at r' = 2 m+1 + 1 — r and hence Eq. (3) has the same 
form as Eq. (1). In the opposite case where the two ordered lists are uncorrelated, P(r' = f) 
should be a flat distribution. 

Figure 2 shows P(r' = f) for r = 1 as a function of r', taken from the numerical MG 
simulation at m = 2, 5 and 10. For small m (m = 2) the anticorrelated strategy to the most 
popular strategy (i.e. r = 1) is at r' = 2 m+1 , i.e. it is the least popular strategy. Hence 
P(r' = f) resembles the 5-function limiting case mentioned above. Very few agents will 
therefore play this anticorrelated strategy. Hence the crowd-anticrowd cancellation will be 
small and a will be large, as can be seen in Fig. 1. As m increases (m = 5) a remarkable 
effect occurs: the peak in P(r' = f) moves up toward r = 1. Hence both r = 1 and its 
anticorrelated partner f are now very popular. For m = 2 it seemed like there was an effective 
'repulsion' between r and f; for m = 5 this seems more like an attraction yielding a bound 
'exciton-like' pair with the crowd (anticrowd) playing the role of the positive (negative) 
charge. For large m (m = 10), the ability of the anticrowd to 'screen' the crowd has 
decreased yielding a rather flat distribution. The strong crowd-anticrowd correlation which 
appears as m increases means that the crowd and anticrowd become comparable in size. 
Hence a is small for m ~ 5 — 6, in agreement with Fig. 1. Note that the MG cannot fully 
'optimize' itself by building equal-sized crowds and anticrowds because of the strategy-space 
frustration built in at the beginning of the game. In modified MG models such as Ref. |[Tl| 



which features evolving stochastic strategies, this frustration is able to relax - similar sized 
crowds and anti-crowds hence emerge yielding a smaller-than-random a for general to. 

For small to, the two lists {n p } and {n r } will be very similar, hence P(r' = f) ~ 
5 r / j2 m+i + i_ r . The discreteness of the agents is now unimportant since n r ^> 1, hence n r can 
be treated as continuous. Equations (2) and (3) yield 
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For s = 2 this becomes 
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The inset in Fig. 3 shows these analytic curves for s = 2 and s = 4 (thick solid lines 
monotonically decreasing). As might be expected using the extreme 5-function form for 
p[r' = f), these curves produce underestimates of a for small to. Now consider the opposite 
extreme of uncorrelated r' and f, i.e. the flat distribution P(r' — f) ~ 2~ (m+1 ). For s = 2 
this gives 
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Equation (6), and its generalization for s > 2, produce /overestimates (thin dashed lines 
in Fig. 3 inset). For a given s, the cr values obtained from separate numerical runs tend to 
be scattered between the two corresponding limiting analytic curves. For larger to (to > 6) 
the discreteness of the agents implies that n r ~ 1 for r < A" while n r = for r > A/". Using 
the integer form of Eq. (2) for high to, and the flat distribution for P{r' = f), yields 
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Hence a approaches the coin-toss limit from below as m — > oo (thick solid line monotonically 
increasing in Fig. 3 inset) as observed numerically. The analytic curves in Fig. 3 (inset) 
have further similarities with the numerical results of Fig. 1: (a) a minimum in a for 
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s = 2, (b) minimum moves to higher mass increases, (c) minimum becomes shallower as s 
increases, (d) a is not sensitive to s for large m. Equation (7) also shows that (Jhigh m ~~ > 
at iV = 2 m+1 , i.e. for m ~ 5 — 6. Using conditional probabilities to go one step beyond the 
approximation of a flat P{r' = f) distribution, we find that <7hi g h m f° r general s is given by 
y/~N[l — 2~( m+1 ' ) (Ns — l)]s/2. Interestingly, this form is consistent with the results of the 
spin-glass replica approach |i2|Jl3| . 

Figure 3 (main graph) shows the spread of numerical values for different runs compared 
to the full crowd- ant icrowd theoretical calculation. The appropriate analytic expressions for 
the probability function P(r' = f) in Eq. (3) involve multiple sums and are cumbersome: we 
therefore obtained the results for each m in Fig. 3 by generating the corresponding P(r' = f) 
similar to those in Fig. 2. The agreement is very good. The theoretical points tend to lie 
toward the high end of the numerical spread, for example at m = 2; this can be attributed 
to the fact that our theory neglects accidental degeneracies in the virtual-point ordered list 
{n p }. We have checked that including a stochastic (i.e. coin-toss) process to break such ties, 
reduces the theoretical o values down toward the mid-point of the numerical spread hence 
making the agreement even better [TB[ . 

In summary we have presented an analytic analysis of crowding effects in the MG which 
provides quantitative agreement with the main finding of Ref. [§J across the entire parameter 
range of interest. 

We thank D. Challet for his comments. 
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FIGURES 

FIG. 1. Standard deviation a for the Minority Game as a function of memory-size m for 
s = 2,3,4 strategies per agent and N = 101 agents. Solid curve: numerical simulation. Dashed 
curve: crowd-anticrowd theory using Eq. (1). Random (coin-toss) limit a = y/N/2 = 5.0 is 
indicated. 

FIG. 2. Probability function P(r' = f) giving the probability that the strategy ranked r' on 
the popularity-ordered list, is anti-correlated with the strategy ranked r. Results are shown for 
r = 1 (i.e. most popular strategy) as a function of r' for m = 2 (dotted-dashed), m = 5 (dotted) 
and m = 10 (solid), s = 2 and N = 101. Note that £ r , P(r' = f) = 1. 

FIG. 3. Theoretical crowd-anticrowd calculation (solid circles) vs. numerical simulations (open 
circles) for s = 2, N = 101. 16 numerical runs are shown for each m. Inset: Theoretical curves for 
a using limiting analytic forms in Eqs. (4)-(7). 
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